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This invention relates to an hsr (hypersensitivity-related) gene family and individual components thereof 
including its promoter and regulatory regions thereof, its coding region, its gene product; modifications thereto; 
applications of said gene, promoter region, regulatory region and coding region and modifications thereto; DNA 
constructs, vectors and transformed plants each comprising the gene or part thereof. 

5 The hypersensitive reaction (HR) of higher plants is a local inducible response associated with disease 

resistance to a pathogen. This response is characterized by a rapid and localized necrosis of tissues invaded 
by an incompatible (avirulent or non-host) pathogen, which prevents further spread of the invading microor- 
ganism. Several defense genes whose products may intervene in this plant response have been extensively 
studied: they include enzymes of the phenylpropanoid pathway involved in the synthesis of antimicrobial phy- 

10 toalexins, enzymes with hydrolytic activities, toxic compounds and cell wali proteins. In infected plants, these 
genes are induced around the necrosis, once it has developed, i.e. late during the HR. Moreover, most of them 
are also strongly expressed during compatible interactions leading to the disease of the plant, and for some 
of them, during the normal development of the plant. The lack of specificity of these defense genes as well as 
their activation in the late steps of the HR suggest that they may not account by itself for establishment of the 

15 complex inducible response that is the HR, but rather may accompany this reaction. To date, the molecular 
mechanisms leading from plant-pathogen recognition to development of the HR are not known. In the "gene 
for gene" hypothesis, the initial step of plant-pathogen recognition leading to resistance involves the putative 
interaction between the products of a plant resistance gene and of the corresponding pathogen avirulent gene. 
Genetic studies indeed revealed that the outcome of many plant-pathogen interactions are determined through 

20 single dominant genes in both partners. Several rapid physiological changes have been also associated with 
the HR, such as electrolyte leakage, changes in respiration rates and more recently oxidative cross-linking of 
cell-wall proteins. However, in no case has a plant gene been described whose activation is specific or at least 
preferential during the resistance reaction, and precedes the development of the HR. 

It is known that Pseudomonas solanacearum, a vascular bacterium, causes a lethal wilting of different 

25 plant species including Solanaceae. In this bacterium, a hypersensitive response (hrp) and pathogenicity gene 
cluster has been shown to control both the ability to elicit the HR on non-host plants and to cause the disease 
on host plants. In particular, hrp gene mutants of P. solanacearum have lost the ability to elicit an HR on to- 
bacco plants. Recently, it was established that the hrpN gene of the hrp gene cluster of another bacterial path- 
ogen, Erwinia amylovora, encodes a proteinaceous HR elicitor called harpin. This result confirms the important 

30 role of hrp genes in eliciting the HR. Upon infiltration of tobacco leaves by an HR-inducing incompatible isolate, 
six different gene families were characterized which are activated early during the interaction, before any nec- 
rosis of the leaf was detected. These genes which were not induced upon infiltration by an hrp-isolate differed 
by the accumulation levels of their transcripts during the incompatible versus the compatible interactions: the 
str (sensitivity-related) genes are expressed to a similar extent in both types of interactions, whereas the hsr 

35 genes are activated preferentially during the HR. 

The present invention relates to an hsr gene family represented by a gene, hereinafter designated hsr203J, 
the sequence of which is depicted in SEQ ID No. 1. The putative protein product (SEQ ID No. 2) of the gen 
exhibits little, if any substantial homology with known proteins. Tests employing i.a. the promoter region of the 
hsr203J structural gene operably linked to a reporter gene in transient gene expression assays and in trans- 

40 genie plants indicate that the expression of the hsr203J gene is closely related to the development of hyper- 
sensitivity: the promoter is specifically activated during the HR several hours before the appearance of th 
necrosis, and the localization of its activation is restricted to cells inoculated with an incompatible bacterial 
isolate. 

According to the present invention there is provided a recombinant DNA sequence including a region com- 
45 prising the nucleotide sequence depicted in SEQ ID No. 1 or a functional equivalent thereof, or a recombinant 
sequence comprising a part of said region or said equivalent. 

Hereinafter where the term "functional equivalent" is used in respect of the protein encoding region of th 
DNA sequence the term means the said region wherein one or more codons have been replaced by their syn- 
onyms, ie codons which specify a corresponding amino acid or a corresponding transcription termination sig- 
50 nal. 

Where the term "functional equivalent" is used in respect of transcriptional regulatory regions of the se- 
quence the term means the said region wherein one or mor nucl otides have been replaced by different nu- 
cleotides and/or the region wherein on or more nucleotides have been added or removed with the proviso 
that the thus produced quivalents retain transcriptional regulatory activity and exhibit substantial homology 
55 with the region, or part ther of, which is 5' to the above mentioned protein encoding r gion. 

As used herein, th term "substantial homology" ref rs to a DNA sequence which hybridizes under con- 
ventional hybridization conditions with a ref rence sequence. Preferably the hybridization conditions refer to 
hybridization in which the TM value is between 35 and 45°C. Most preferably the term substantial homology 
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refers to a DNA sequenc which hybridizes with the reference sequence under stringent conditions (as defined 
below). 

The term "regulatory region" as used herein refers to the nucleotide region in the sequence depicted in 
SEQ ID No. 1 which is 5' to the protein encoding region in the sequence. The regulatory region thus includes 

5 the promoter of the hsr203J gene and the functional components of the promoter which affect transcription. 
Such functional components include a "deletion promoter" and transcriptional "silencers" and "enhancers". 

A "deletion promoter" within the context of the present invention is any hsr203J derived promoter which 
has a deletion relative to the natural promoter and which still retains promoter activity. Such promoter activity 
may be enhanced or substantially the same when compared to the native promoter. The skilled man is aware 

10 of the manner in which deletion promoters can be assayed for retention of their promoter activity. Deletion pro- 
moters according to the present invention are inducible, inter alia, by plant pathogens, and find utility in con- 
structs comprising structural genes providing for improved disease resistance. 

Where the term "functional equivalent" is used in connection with a protein, the sequence of which is dic- 
tated by at least a part of the DNA sequence depicted in SEQ ID No. 1, the term means a protein having a like 

15 function and like or improved specific activity, and a similar amino acid sequence. The present invention in- 
cludes pure proteins which have an amino acid sequence which is at least 60% similar to the sequence or part 
(see below) thereof of the protein depicted in SEQ ID No. 2. It is preferred that the degree of similarity is at 
least 60%, more preferred that the degree of similarity is at least 70% and still more preferred that the degree 
of similarity is at least 80%. 

20 In the context of the present invention, two amino acid sequences with at least 60% similarity to each other 

are defined by having at least 70% identical or similar amino acids residues in the same position when aligned 
optimally allowing for up to 4 deletions or up to 10 additions. For the purpose of the present invention: 
Alanine, Serine and Threonine are similar; 
Glutamic acid and Aspartic acid are similar; 
25 Asparagine and Glutamine are similar; 

Arginine and Lysine are similar, 
Isoleucine, Leucine, Methionine and Valine are similar; 
Phenylalanine, Tyrosine and Tryptophan are similar. 
Where the term "part" is used in connection with a protein sequence, the term means a peptide comprised 
30 by the sequence depicted in SEQ ID No. 2 and having at least 5 amino acids. More preferably the peptide has 
* at least 20 amino acids, and still more preferably the peptide has at least 40 amino acids. 

Where the term "part" is used in connection with a nucleotide sequence, the term means a nucleotide se- 
quence comprised by the sequence depicted in SEQ ID No. 1 and having at least 15 nucleotides. More pre- 
ferably the part has at least 25 nucleotides, and still more preferably the part has at least 40 nucleotides. 
35 The invention also includes a recombinant DNA sequence including a region comprising nucleotides 1413 

to 241 7 of the sequence depicted in SEQ ID No. 1 or a functional equivalent thereof, or a recombinant sequenc 
comprising a part of said region or said equivalent. Nucleotides 1413 to 2417 correspond to the protein-en- 
coding region of the hsr203J gene which is useful in that the gene product has a functional role in regulating 
or providing for disease resistance in plants. Thus, the protein coding sequence, or a part thereof, of th 
40 hsr203J gene may be fused to an inducible promoter such as that regulating expression of WIN, WUN or PR- 
proteins so that upon infection by a compatible pathogen, expression of the hsr203J structural gene is induced. 
The ensuing activation of the hypersensitive response by the hsr203 J protein in infected plant cells halts further 
spread of the pathogen. 

The invention also includes a recombinant DNA sequence including a region comprising nucleotides 1 to 
45 1341 of the sequence depicted in SEQ ID No. 1 or a functional equivalent thereof, or a recombinant sequenc 
comprising a part of said region or said equivalent. Nucleotides 1 to 1341 correspond to the non-protein en- 
coding region of the sequence which is 5' to the said protein encoding region. The region of the said DNA s - 
quence comprising nucleotides 1 to 1341 includes the transcriptional regulatory region of the hsr203J gene, 
including the promoter (binding site for RNA polymerase) and transcriptional silencers and enhancers. 
so Silencer and enhancer elements are useful in that they enable modulation of the level of expression of 

the structural genes under their control. 

The inv ntion still further includes a recombinant DNA sequence including a r gion comprising nucleotides 
1 to 651 of the sequenc depicted in SEQ ID No. 1 or a functional equivalent thereof, or a r combinant sequence 
comprising a part of said region or said equivalent. The region comprising nucleotid s 1 to 651 includes a tran- 
55 scriptional silencer. 

The invention still further includes a recombinant DNA sequence including a region comprising nucleotides 
652 to 1341 of the sequence depicted in SEQ ID No. 1 or a functional equivalent thereof, or a recombinant 
sequence comprising a part of said region or said equivalent. The region comprising nucleotides 652 to 1341 
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includes a transcriptional enhancer and the promoter (ie RNA polymerase binding site) of the hsr203J gene. 

The invention further provides the use of hsr203J promoter sequences as affinity substrates for the iden- 
tification and subsequent purification of hsr203j promoter binding proteins (hsr-PBP's) and proteins associated 
with these hsr-PBP's. Such hsr-PBP's have been partially characterized, are probably present constitutively 
5 and may bind to hsr203 J promoter sequences upon incompatible reaction of the host plant such as occurs when 
Nicotiana tabacum L is inoculated with specific strains of Pseudomonas solanacearum. 

The invention still further includes a recombinant DNA sequence including a region comprising nucleotides 
1195 to 1341 of the sequence depicted in SEQ ID No. 1 or a functional equivalent thereof, or recombinant se- 
quence comprising a part of said region or said equivalent. The region comprising nucleotides 1195 to 1341 
10 includes a bacterial response element which is capable of binding to specific proteins which are produced by 
pathogens during their infection of tissue, and which are implicated in the development of the hypersensitive 
response (see above). 

The invention still further includes a recombinant DNA sequence including a region comprising nucleotides 
1195 to 1268 of the sequence depicted in SEQ ID No. 1 or a functional equivalent thereof, or a recombinant 
15 sequence comprising a part of said region or said equivalent. This region more precisely defines the bacterial 
response element. 

The invention still further includes a recombinant DNA sequence as disclosed above wherein the said re- 
gion, part or equivalent thereof is located on the 5' side of, and is operably linked to, a protein-encoding se- 
quence of a heterologous gene or to a sequence comprising nucleotides 1 4 1 3 to 241 7 of the sequence depicted 
20 in SEQ ID No. 1 or a functional equivalent thereof. It is particularly preferred that a translation enhancing se- 
quence is present between the region or part or equivalent thereof, and the protein-encoding region of the DNA 
sequence 3' thereto. 

The heterologous gene may be any suitable structural gene, including a selectable or screenable marker 
gene or a gene, the product of which is capable of conferring resistance or tolerance to at least one of the fol- 
25 lowing: insects, herbicides, fungi, bacteria and viruses, a marker gene for use in disease pressure forecasting 
and anti-feedant genes. 

The promoter, and/or regulatory regions of the hsr203J gene may be fused to a structural gene encoding 
a non-diffusible cytotoxic gene product such as an ribonuclease, protease, lipase or glucanase. Induction of 
expression of such structural genes provides a rapid and localized response to infection by pathogens, and 

30 may be useful in providing resistance or improving tolerance of the plant to the pathogen. 

Moreover, the regulatory regions of hsr203J gene may be used in the creation of "detector 4 * plants enabling 
the early detection of disease pressure. The hsr203J promoter and/or regulatory regions thereof, may be fused 
to a nucleotide sequence providing for a visual alteration to the host, plant phenotype upon activation of the 
promoter by infection. Such sequences include the anti-sense orientation of the gene encoding the Small Sub- 

35 unit of Ribulose B-phospho Carboxylase (SS-RUBISCO) which causes localized bleaching of green tissues. 
Such sequences could also encode a gene encoding a key enzyme in pigment biosynthesis such as chalcon 
synthase. 

The invention also includes recombinant DNA according to the invention, which is modified in that codons 
which are preferred by the organism into which the recombinant DNA is to be inserted are used so that ex- 
40 pression of the thus modified DNA in the said organism yields substantially similar protein to that obtained by 
expression of the unmodified recombinant DNA in the organism in which the protein-encoding components of 
the recombinant DNA are endogenous. 

The invention still further includes a DNAsequence which is complementary to one which, under stringent 
conditions, hybridizes to any one of the above disclosed recombinant DNA sequences. 
45 "Stringent hybridization conditions" are those in which hybridization is effected at between 50 and 60°C 

in 2X saline citrate buffer containing 0.1%SDS followed by merely rinsing at the same temperature but in a 
buffer having a reduced SCC concentration which will not affect the hybridizations that have taken place. Such 
reduced concentration buffers are respectively (a) 1xSCC f 0.1%SDS; or (b) O.SxSCC, 0.1%SDS; or (c) 
O.lxSCC, 0.1%SDS. 

50 The invention still further includes a DNA vector comprising a recombinant DNA sequence according to 
the invention or a DNA sequence which is complementary to one which, under stringent conditions, hybridizes 
thereto. 

It is preferred that the vector according to the inv ntion be used to transform a eukaryotic host, preferably 
of plant origin. It will be appreciated that suitable micro-organisms may b transformed with such a vector, and 
55 such micro-organisms r present yet a further embodiment of the invention. 

The term "plant" is used herein in a wide sense and refers to differentiated plants as well as undifferen- 
tiated plant material such as protoplasts, plants cells, seeds, plantlets etc. that under appropriate conditions 
can develop into matur plants, the progeny thereof and parts thereof such as cuttings and fruits of such 
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plants. 

Preferred vectors will of course vary depending on the chosen host. For dicotyledons, the vector may be 
introduced into a protoplast by contacting the vector with the protoplast in a suitable medium and under ap- 
propriate conditions which render the protoplast competent for the uptake of DNA; the vector may also be em- 

5 ployed in the form of an Agrobacterium tumefaciens Ti-plasmid derivative which infects plant cells or proto- 
plasts. Monocotyledons are preferably transformed by micro-injection, electroporation or by use of the micro- 
projectile gun, using the so-called ballistic technique. In any case, appropriate transformation vectors and pro- 
tocols are well known in the art. The transformed cells or protoplasts are cultured in an appropriate culture 
medium, and a transformed plant is regenerated in a manner known perse. The introduced nuclear material 

10 is stably incorporated into the genome of the regenerated transformed plants which accordingly express the 
desired genes. 

Examples of genetically modified plants according to the present invention include: fruits, including tom- 
atoes, peppers, mangoes, peaches, apples, pears, strawberries, bananas, and melons; field crops such as 
canola, sunflower, tobacco, sugar beet, small grain cereals such as wheat, barley and rice, corn and cotton, 
15 and vegetables such as potato, carrot, lettuce, Brassica oleracea such as cabbage and onion. The particularly 
preferred plants are sugar beet and corn. 

The invention still further includes the progeny or seeds of such plants, and the seeds and progeny of 
said progeny. 

The invention still further includes protein obtained by expression of the recombinant DNA according to 
20 the invention, and in particular, expressed protein having the amino acid sequence depicted in SEQ ID No. 2, 
or a part thereof or a functional equivalent of said sequence or part. 

The Invention will be further apparent from the following description, and the associated Figures and Se- 
quence Listings. 

25 Of the Figures : 

FIG. 1 shows a chimeric construct used for transient gene expression assays in tobacco protoplasts and 
for transformation of tobacco plants via Agrobacterium tumefaciens. Fig (A) shows the restriction map of the 
chimeric p-glucuronidase gene on pHG21 (or pHG21 A). This gene consists of a translational fusion between 
30 1 .4 kb of the 5' flanking sequence from the hsr203J gene and the coding region of uidA gene linked to the no- 
paline synthase gene polyadenytation signal (nos T). Fig (B) shows the sequence (SEQ ID No. 2) of the pHG21 
translational fusion joint. The hsr203J gene sequence is in bold type and the uidA sequence is in standard 
type. The orientation is 5' to 3', and the arrow in the Figure indicates the position of the fusion between the 
sequences. 

35 FIG. 2 shows the effect of infection with different isolates (hrp, K 60 and GM1 1 000) of Pseudomonas sol- 

anacearum on hsr203J promoter activity in transformed tobacco protoplasts. As a control, water was added 
to the protoplasts. Plasmids pBI201 and pBI221 are respectively negative and positive control plasmids; 
pHG21 is the hsr203J-uidAgene fusion. GUS activity assays were performed 24 h after incubation. The data 
shown represent the mean of three separate experiments. 

40 FIG. 3 shows the time course of hsr203J promoter activation of the GUS gene in transgenic tobacco leaves 

infiltrated with different isolates (hrp, K 60 and GM1 1000) of P. solanacearum. GUS activity was measured in 
extracts of four leaves from two pHG21-14A transformants. 

FIG. 4 shows a quantitative analysis of GUS activity in locally bacterial-infected transgenic tobacco plants. 
Figure (A) shows the induction of ^-glucuronidase activity in the inoculated third leaf, and in the upper and 

45 lower un-inoculated leaves. Figure (B) shows the induction of p-glucuronidase activity in and around the lesion 
of the inoculated third leaf. The following tissue samples were assayed: lesion meaning necrotic tissue resulting 
from the wounding and/or bacterial infection; 0-3mm meaning apparently healthy tissue up to 3mm from th 
lesion; 3-6mm meaning apparently healthy tissue 3 to 6mm surrounding the lesion. Inoculation was performed 
on pHG21-14A transformants. Small leaf perforations were covered by a droplet of the bacterial suspension 

so (3 p,L containing 10 8 c.f.u./mL) or water, as indicated on the Figure. Tissue samples were collected 18 h after 
inoculation. 

FIG. 5 shows the effect of hrp mutants on the activation of hsr203J promoter in transgenic pHG21(14A) 
tobacco plants. Figure (A) shows localization of hrp mutations in the different transcription units of th hrp gene 
cluster. Figur (B) shows measurements of GUS activity in leaves at 18 h after inoculation by th hrp K60 or 
55 GMI 1000 isolates or by water, or by th hrp mutants indicated in Figure (A). Inoculation was performed as 
described for Figure 4. 

FIG. 6 shows sch matically the construction of plasmids pHGD having several deletions of pHG21. 
FIG. 7 shows in transg nic tobacco plants th expr ssion of the GUS gene by constructs obtained by 
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5* promoter deletions of pHG21 (according to the scheme of Figure 5). The plants were transformed with 5 \xg 
DNA, and the value 100 was given to the GUS activity obtained by transformation with the pHG21 construct. 
The Figure shows the increase in activity (after 18 hours) of the GUS gene as a consequence of infiltration 
of the transformed plants with the bacterial strains Delta 3, K60 and GM1 1000. As controls plants were infil- 
5 trated with water. 

Of the Sequences : 

SEQ ID No. I shows the nucleotide sequence of the hsr203J gene, including the protein encoding region 
w and promoter and transcriptional regulatory elements therefor, isolated from tobacco. The protein coding re- 
gion of the gene is comprised by nucleotides 1413 to 2417 in the sequence. Putative polyadenylation signals 
are present 3' to the protein coding region of the gene and the sequence responsible for the HR is within about 
1.4 kb of the 5' non-coding region of the gene. In essence the sequence comprises: 
a) a 72 bp mRNA leader sequence, located at nucleotides 1341 to 1412 inclusive; 
75 b) CAAT and TATA consensus sequences located at nucleotide positions 1282-1286 and 1313-1316 re- 

spectively; 

c) the translation start site codon at nucleotide positions 1413-1415; 

d) the "deletion promoter" sequence located at nucleotides 1-1341 inclusive which is substantially respon- 
sible for the promoter activity; 

20 e) the sequence located at nucleotide positions 1195-1268 having an enhancing effect on the promoter 

activity; 

f) the sequence located at nucleotide positions 1-651 having a silencing effect on the promoter activity. 
SEQ ID No. 2 shows the translation product of the hsr203J structural gene, encoded by nucleotides 1413- 
2417 in SEQ ID No. 1; 

25 SEQ ID No. 3 shows a linker region for a chimeric gene comprising the 5* flanking region of the hsr203J 

structural gene and the coding region of the uidA reporter gene. The start codon for the hsr203J structural 
gene is at nucleotides 10-12 in the sequence and nucleotides 13-64 encode the N-terminal sequence of the 
hsr203J gene product. 

30 Bacterial Strains and Plant Material 

The source of the Pseudomonas solanacearum strains used herein is depicted in Table 1 . 
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TABLE 1 



10 



15 



20 



Pseudomonas solanacearum wild type and mutant strains used in this study, 
and their ability to induce symptoms on tobacco ^ 

S^. ains Source or reference Isolated from Tobacco response 



Wild type 

GMI 1000 Boucher et aL (1) Tomato mk 

K60 Lozano et aL (17) Tomato Disease 

Mutants derived from GMI 1000 (deletion of hrp gene cluster) 
^rp Boucher et aL unpublished No symptoms 

Mutants in hrp gene cluster derived from GMI 1000 (Tn5-B20 mutagenesis) 
GMI 1462, 1475, 

1494, 1492, 1487 Arlat et aL (18) No symptoms 

GMI 1423, 1425 Arlat et aL (18) Partial and/or 

delayed HR 

25 Mutant derived from GMI 1000 (Tn5-B20 mutagenesis outside the hrp gene cluster) 
GMI 1485 Arlat et aL (18) HR 



30 

The GM1 1 000 and K60 isolates are wi Id-type P. solanacearum strains, the former induces the development 
of an HR on tobacco leaves within 24 h after infiltration, and the latter causes the typical lethal wilting disease. 
A derivative of the GM1 1000 isolate, called Ahrp, deleted for the hrp gene cluster, causes no apparent symptoms 

35 in inoculated leaves. Eight mutant strains derived from GMI1000 by transposon Tn5-B20 mutagenesis were 
used as described below. The GMI1462, 1475, 1494, 1485, 1423 and 1425 strains are each mutated in one 
of the six putative transcription units of the hrp gene cluster. All these strains have lost the ability to cause an 
HR on tobacco, except strains GMI1423 and 1425 which are mutated in the right-hand end of the hrp gene 
cluster, and induce only a partial and/or a delayed HR on tobacco; and the strain GM1 1485 which is mutated 

40 outside of the hrp gene cluster and elicits a normal HR on tobacco and constitutively expresses the structural 
gene of p-galactosidase. All these are grown at 28°C in B or BGT media (1). The cultivars of Nicotiana tabacum 
L. used herein: Bottom Special and Samsun, exhibit similar responses after bacterial inoculation. The seed- 
lings are grown in vitro on Murashige and Skoog (MS) medium (2) during 4 to 5 weeks (25° C ( 1 6 h photoperiod, 
15 Watt/m*), and then transferred to soil in a growth chamber (25°C, 16 h photoperiod, 30 Watt/m*). 

45 

Isolation of hsr203J gene, and nucleotide sequence analysis 

A tobacco (Nicotiana tabacum L cultivar NK326) genomic library constructed in the bacteriophage X- 
Embl3 (Clontech) is screened with the pNt203 cDNA clone (3). The Pstl insert of pNt203 is labeled by the ran- 

50 dom primer technique (4). Replicate nitro-cellulose filters of the genomic library are treated and hybridized as 
suggested by the manufacturers (Amersham). Four different genomic clones including hsr203J are isolated. 
Exonucleas III deletions are performed at both ends of DNA inserts sublconed in the phagemid pKS (Stra- 
tagen ) according to Henikoff (5), and both strands are sequenced by the dideoxy chain tenmination method 
(6) using Sequ nase (US Biochemical, Corp.). Sequence compilation and analysis are performed by using the 

55 Genetics Computer Gr up software of the University of Wisconsin (7). Homology searches with the Gen bank 
(release 71.0) and Swissprot (release 21.0) data bases are performed using the FASTA algorithm (8). The pro- 
tein sequences are analysed for potential N-terminal signal s quences and membrane-spanning domains us- 
ing release 5.0 of the PC/Gene Programme (Department of Medical Biochemistry, University of Geneva, Swit- 
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zerland). The transcription start site is determined by the primer extension technique using polyA+ RNA ex- 
tracted from tobacco leaves 9 hours after inoculation with the incompatible isolate and an oligonucleotide lo- 
cated at the ATG codon (nucleotides 1413 to 1415 in SEQ ID No. 1). 

5 Reporter gene constructs 

A 2.2 kilobase (kb) Bglll fragment containing 1.3 kb of the 5' non-coding region of the tobacco hsr203J 
gene and 890 base pairs (bp) of the nucleotide sequence downstream of the transcription start site is clon d 
into the BamHI site of phagemid pKS, to produce pKJ2.2. This plasmid is digested with BstBI, which cuts once 

10 55 bp 3' of the hsr203J translation initiation codon, and the BstBI generated ends were blunt end ligated by 
the Klenow fragment of DNA polymerase before digestion with Sail. This 1 .5 kb Sail - BstBI fragment is cloned 
into the Sail - Smal site of the p-glucuronidase (GUS) expression binary vector pBI101.2 (9) to produce the 
hsr203J- uidA gene fusion pHG21 A. A 3.5 kb Hindlll - EcoRI DNA fragment of pHG21A, including the hsr203J 
promoter and the uidA coding sequence, is ligated into the Hindlll - EcoRI digested pUC19 vector to produce 

15 pHG21 t for transient expression gene assays (Fig. 1). The pHG21 and pHG21 A constructs therefore contain 
1 341 bp 5' non coding sequence, the 72 bp leader sequence, the first 55 bp of the coding sequence of hsr203J 
fused in frame with the GUS coding sequence, and the nopaiine synthase (nos) gene pc yadenylation signal. 
The translational fusion is confirmed by direct double-stranded sequencing with a GUS specific primer (10). 
Two additional plasmids, pBI201 and pBI221 , contain respectively a promoterless uidA gene, and a cauliflower 

20 mosaic virus (CaMV) 35S promoter - uidA gene, upstream of the nos terminator, in the pUC19 vector (Clon- 
tech). 

Protoplast isolation and transient expression assays 

25 Leaves of 4 to 5-week-old in vitro grown tobacco plants, cultivar Samsun NN, are used for isolation of pro- 

toplasts by incubating leaf sections in TO medium (11) containing 1 g/Lcellulase R10 Onozuka, 200 mg/Lmac- 
erozyme Onozuka (Yakult Honsha, Nishinomiya, Japan) and 500 mg/L pectolyase Y23 (Seishin Pharmaceut- 
ical Ind.), for 15 h at 22° C in darkness. Protoplasts are separated from the cellular debris by sieving through 
an 85 |im nylon mesh followed by centrifugation at 50 g for 5 min onto a 1 ml_ cushion of 19 % (w/v) sucrose. 

30 Floated protoplasts are washed once with TO medium, counted, and adjusted to the density of 1.5 x 10 6 pro- 
toplasts/mL. Transformation is performed by incubating the protoplasts (320 u.L samples) at 45° C for 5 min, 
after a brief cooling at room temperature, by adding plasmid DNA (50 u.g per assay in 10 mM Tris-HCI, pH 8) 
and 160 \iL of a PEG solution (40 % PEG, 0.4 M.mannitol, 30 mM MgCI 2 , 0.1 % Mes pH 5.8). Protoplasts are 
gently mixed for 10 min at room temperature. They are then collected by centrifugation and resuspended in 

35 500 |iL TO medium. The bacterial suspension (10 bacteria/protoplast) prepared as previously described (12) 
is then added. After incubation at 28° C for 24 h, the protoplasts are lysed by the addition of 50 \lL of 1 0X GUS 
buffer, centrifuged and the supernatant is assayed for GUS activity (10). 

Transgenic tobacco plants 

40 

pHG2A, pBI121 ( and pBI101 are mobilised from Escherichia coli DH5a into Agrobacterium tumefaci ns 
strain LBA4404 (13) and transgenic tobacco plants (N. tabacum, Bottom Special) are generated by the leaf 
disk method (14). Transformed plants are selected on MS medium containing 0.8 % Difco agar, kanamycin at 
100 ng/mLand carbenicillin at 500 ng/mL. Transgenic plants are self-fertilized and seeds are collected. Their 
45 genotypes are determined by progeny (T2) analysis, by germination on MS medium containing kanamycin (500 
u.g/mL). 

Inoculation of transgenic plants with bacterial isolates 

so All the inoculation experiments are performed on kanmycin-resistant T2 plants, with at least 2 plants of 

the same genotype per experimental condition. For the screening of transformants and kinetic experiments, 
tobacco leaves are detached from 8 week-old plants and infiltrated in vacuo with the bacterial suspension (10 7 
c.f.u./mL) or water as described in ref (1 2). Syringe inf i Itration experiments are p rformed on 8 week-old plants 
by infiltrating the bacterial suspension (10 8 c.f.u./mL) into a small region of undetached leaves with a syringe 

55 without a needl . For some experiments, inoculations wer performed on 5 week-old plants grown in Magenta 
cubes (Sigma) on MS medium. Each half of a leaf is perforated 6 times with an 10 ^-Hamilton needle and a 
3 uL droplet of bacterial suspension (1 0 8 c.f.u./mL in 0.4 % Difco agar) is immediately deposited at the wounded 
sites. 
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K 

For localized root inoculation, 4 week-old plants are grown on a raft (Sigma) in contact with MS medium 
containing 0.2 % Difco agar, and inoculated with a 3 ul_ droplet of bacterial suspension through a wound made 
with a needle at one centimeter from the root apex or at a secondary root emergence. For generalized root 
inoculation, the whole plant is detached carefully from the raft, avoiding wounding, and the root system is im- 
5 mersed in 7 mL of the bacterial suspension (10 s c.f.u./mL). 

Inoculated plants are maintained at 28° C, and analysed either directly or stored at -80° C after incubation 
time. 

GUS assays 

10 

Plant tissue is ground in liquid nitrogen, homogenized in 1X GUS buffer, centrifuged for 5 min at 10,000g 
and the supernatant assayed for GUS activity, as previously described (15). Protein concentration is deter- 
mined using the Bradford dye reagent. GUS activity is expressed as picomoles of 4-methylumbelliferone per 
min per mg of protein. Alternatively, histochemical assays are performed on fresh tissue using X-gluc (5-bromo- 
15 4-chloro-3-indolyl-p-D-glucuronide, Clontech) or Magenta-gluc (Biosynth AG) as the substrate (1 0). For some 
experiments, samples are fixed in 0.3 % formaldehyde/ 50 mM NaP0 4 buffer pH7, then cleared by boiling in 
ethanol and stored in ethanol 70 %. 

ft-galactosidase assays 

20 

Following the GUS histochemical assay, some samples are equilibrated in Z' buffer (16) (100 mM NaP0 4 
buffer pH 7.4, 10 mM KCI, 1mM MgS0 4 ), fixed in 1.25 % glutaraldehyde for 1 h in order to inactivate endoge- 
nous plant p-galactosidases, rinsed and stained at 28° C with 0.8 mg/mL Magenta-Gal (Biosynth Ag) or X-gal 
in 2' buffer containing 5 mM K 3 FeCN) 6 and 5 mM K4Fe(CN) 6 , then cleared by boiling in ethanol and observed 
25 by dark- or bright-field microscopy. 

Characterization of hsr203J gene 

The hsr203J gene is isolated by screening a genomic tobacco library with pNt203 cDNA clone. It belongs 
30 to a small multi-gene family consisting of a minimum of 4 genes (see ref. 3) and at least 2 genes of this family 
corresponding to 2 different cDNA clones (pNt203 and pNt239) are expressed during the HR. 

Sequence analysis of the 2.7 kb DNA region of hsr203J (SEQ ID No. 1) reveals a single open reading frame 
(ORF) with no intron and a potential coding capacity of 355 amino acids. The nucleotide sequence of the said 
2.7 kb region Is identical to the pNt239 cDN A clone except for 2 substituted bp (not shown). These mismatches 
35 are probably due to the isolation of the genomic and cDNA clones from different tobacco cuttivars: the genomic 
clone is isolated from cultivar NK326 whereas the pNt239 cDNA clone is obtained from the cultivar Bottom 
Special. The predicted hsr203J structural protein (SEQ ID No 2) has a Mr. of 37.5 kDa and a theoretical iso- 
electric point of 5.17. 

The transcription start site is mapped by primer extension to a position 72 bp upstream of the putative 
40 translation initiation codon. The promoter and 5'-untranslated region exhibited no obvious sequence homology 
to cis-elements already described in defense genes. 

Transient expression of the hsr203J-uid A gene fusion in tobacco protoplasts 

45 pHG21 plasmid is composed of a translation fusion between 1 .4 kb of the 5* flanking sequence from th 

hsr203J gene and the coding region of the uidA reporter gene, linked to the 3' untranslated region of the no- 
paline synthase gene (Fig. 2). The plasmids pBI201 and pBI221 are used respectively as negative and positiv 
controls in transient assays. 

Initial experiments show that protoplast viability as quantified by Evans blue exclusion is not significantly 

so altered in the presence of bacteria at 10 to 100 bacteria per protoplast (data not shown). Subsequently experi- 
ments are performed with 10 bacteria per protoplast. At this bacterial density, the expression of GUS fused 
to the hsr203J promoter in response to GMI1000 isolate is 6-fold higher than in response to the controls (water 
orAhrp inoculation) (Fig. 3). In comparison, inoculation with the compatible isolate, K60, led to a 2-fold increas 
in enzyme activity. These levels of GUS activity have to be compared with those measured in protoplasts trans- 

55 formed with the CaMV 35S-uidAgene fusion (pBI221) which exhibit a high and almost constitutive level after 
the various inoculation treatm nts (Fig. 3). 

The results of transient assays therefore indicate clearly that the hsr203J promoter contains all the nec- 
essary elements for its preferential activation by the HR-inducing bacterial isolate, and that this expression 
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system perfectly mimics the plant/pathogen interaction. 

Expression of hsr203J-uid A gene fusion in transgenic tobacco 

5 In order to determine the spatial and temporal patterns of expression of the hsr203J promoter in planta, 

the hsr203J-uidA gene fusion is transferred to tobacco by leaf disk transformation. T2 plants resistant to ka- 
namycin are used in all the experiments. Of 23 kanamycin resistant transformants, 20 expressed the gene 
fusion and these all exhibit the same overall pattern of expression: GUS activity is found maximal after infil- 
tration with GMI1000, with a 2- to 90-fold stimulation over control infiltrations (water or Ahrp), and a 2- to 25- 
10 fold induction over K60 infiltration, 18 hours after inoculation (not shown). These levels are comparable to 
those obtained in transient experiments after inoculation by GM1 1000 or K60. 

Based on this analysis, a transformant (pHG21-14A) which displays a 90-fold stimulation of GUS activity 
after incompatible inoculation compared to control infiltrations, and contains one insertion of the gene fusion 
per haploid genome, is selected. The presence of a native gene fusion is checked by Southern analysis of gen- 
ts omic DNA (not shown). 

Assay of extractable GUS activity and GUS histochemical localization are both used to monitor the activity 
of the hsr203J promoter in different organs during plant development and in response to bacterial inoculation. 
No GUS activity was detected in 4, 7 or 15 day-old pHG21-14A tobacco seedlings, either in healthy leaves, or 
in flowers of fully grown plants (data not shown). These data indicate that the hsr203J promoter is strongly 
20 activated in leaves inoculated with the HR-inducing isolate, GMI1000, 18 h after infiltration, as indicated by 
the screening of all the transformants obtained. A kinetic study is performed on transformant pHG21-14A(Fig. 
3), which shows that in leaves infiltrated with GMI1000, GUS activity increases to a level 12-fold over control 
values 6 h after inoculation, reaches a maximum of 200-fold stimulation at 9 h, and decreases to an intermedi- 
ate level (80-fold induction) upon longer incubations. Much lower levels are measured after K60 infiltration, 
25 and undetectable levels of GUS activity were found in leaves infiltrated with water or the Ahrp isolate at any 
incubation time. 

Plants transformed with the promoterless construct pHI1 01 show negligible levels of GUS activity. More- 
over, plants transformed with pBI121, which contain a CaMV 35S-uid A gene fusion, show similar levels of 
enzyme activity, whatever the nature of the inoculum (not shown). Thus the hsr203J-uid A gene fusion exhibits 
30 a distinct and specific pattern of activation upon bacterial inoculation of transgenic tobacco plants that closely 
matches the in vivo pattern of accumulation of hsr203J transcripts in infiltrated tobacco leaves (3). These re- 
sults also indicate that hsr203J promoter is early and specifically activated during an incompatibl 
plant/pathogen interaction, and that its induction is hrp gene-dependent since the bacterial isolate which is 
deleted of hrp genes is unable to activate the hsr203J promoter. 

35 

Localization of hsr203J-uidA activation in response to bacterial inoculation 

Different inoculation tests are performed on transformants pHG21-14A in order to localize precisely 
hsr203J promoter activation in response to bacterial inoculation; first, in tobacco leaves in order to investigat 
40 promoter induction during a typical HR, and secondly, in roots, which are the organs naturally infected by th 
bacteria. 

Leaf inoculations : 

45 In order to test whether the hsr203J-GUS gene expression is local or systemic, leaves of 5 week-old trans- 

genic plants are inoculated with bacterial suspension droplets. After incubation for 18 and 70 hours, GUS ac- 
tivity is determined in half of the inoculated leaf as well as in upper and lower leaves. The results show a 15- 
fold induction of this activity in the inoculated leaf, whereas very low levels are detected in the lower and upper 
leaves (Figure 4A). The other half of the inoculated leaf is used for histochemical GUS assay. A narrow blue- 

so stained region is visualized 18 h and 70 h after inoculation with the HR-inducing bacterial isolate, surrounding 
the wounded area, which is restricted to a few cell layers and is localized very close to yellowing, probably 
dead, cells. Th intensity of the staining increases 70 h after inoculation. Only a few dispersed cells exhibit a 
faint blue staining after K60 inoculation; water or Ahrp isolate inoculations induce no detectable GUS expres- 
sion. Staining of transgenic plants harboring a chimeric uidAgene und rth control of the CaMV 35S promoter 

55 r suits in the staining of the entire I af, with no preferential staining around th lesions, thus demonstrating 
th sp cific nature of the induction of the hsr203J promoter in this area. A mor detailed localization of this 
activation during infection is provided by GUS activity measurements in small squares surrounding the lesion, 
1 8 h after inoculation (Fig. 4B). High levels of enzyme activity (48-fold stimulation over control values) are found 
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only within the necrotic lesion itself after inoculation by GMI1000. No detectable enzyme activity is found in 

tissue up to 3mm away from the lesion. 

In order to determine how early the hsr203J promoter is activated in the inoculated area, histochemical 

GUS localizations are performed on leaves of 8 week-old transgenic plants locally infiltrated by a syringe with 
5 the bacterial suspensions or K60. As early as 6 h after inoculation by the GMI1 000 isolate at which time there 

is no visible tissue necrosis, the leaf infiltrated area shows a blue staining whose intensity increases 9 h after 

inoculation. At later incubation time points, a yellow necrosis progressively appears, limited on its border by 

a thin blue area still located within the infiltrated part of the leaf. 

These different experiments show clearly that hsr203J-GUS expression is confined to a restricted area 
w corresponding precisely to cell layers infected by the HR-inducing isolate, GMI1000. 

Root inoculations : 

Roots of transgenic plants grown on rafts are wounded and inoculated with a droplet of bacterial suspen- 
ds sion. After 48 h incubation, histochemical localization of GUS activity is performed. Staining only observed in 
roots infected by GMI1000 extends from the initially inoculated site to a 2 mm distance in the root. Cytological 
studies indicate that hsr203J promoter activation appears not to be cell-type dependent (not shown). A gen- 
eralized root inoculation is also performed by simply immersing the whole root system in a bacterial suspen- 
sion. In this case, GUS activity is found in restricted regions of the roots, i.e. at the point of origin of secondary 
20 roots. Expression of the gene fusion at this specific location has to be correlated with the existence of prefer- 
ential sites of bacterial entry into the host which have been observed along the emergence sheath of secondary 
roots. At these specific sites, a double staining of GUS activity and bacteria by using a bacterial isolate con- 
taining a p-galactosidase fusion, shows a good correlation between the activation of the hsr203J promoter and 
the presence of bacteria. A superficial and intercellular bacterial colonization of the root tips has also been ob- 
25 served and results in a strong activation of the hsr203 promoter in this part of the root. 

Thus, the hsr203J-GUS gene fusion exhibits a distinct and specific pattern of activation in transgenic to- 
bacco plants in response to bacterial infection and one which closely matches the pattern of bacterial ingress 
into the plant. 

30 Dependence of hsr203J-uidA activation on hrp genes 

Different P. solanacearum strains mutated in one of the six transcription units of the hrp gene cluster (Fig. 
5A).are used to inoculate transgenic plants (pHG21-14A) by the droplet method. These mutant strains have 
lost the ability to induce an HR on tobacco, although two of them, GMI1425 and GMI1423, lead to a partial or 
35 delayed HR. 18 h after incubation, no effect on GUS activity can be detected with 6 out of 7 tested mutants; 
only GMI1423 leads to an increase in enzyme activity comparable to that of the wild type strain, GMI1 000 (Fig. 
5B). These data indicate that hsr203J activation requires almost a whole functional hrp gene cluster. 

Until now, no plant gene has been identified which is specifically implicated in the perception of an in- 
compatible pathogen, the transfer of that signal throughout the cell or finally the programmed cell death (HR) 
40 which provides an efficient mechanism for the limitation and eventual elimination of the pathogen. 

The gene hsr203J (SEQ ID No. 1) is the first hypersensitivity-related gene to be isolated, whose promoter 
exhibits a rapid, high-level localized and specific activation in response to an HR-inducing bacterial isolate. 

Construction of deletions of the 5' promoter region of pHG21 

45 

Unidirectional deletions of the promoter of the chimeric gene have been realized starting from the 5' nd 
according to Henikoff (5). For that purpose, plasmid pHG21 (Fig. 1) is linearised employing the restriction en- 
zymes Shpl and Sail, and then digested by exonuclease III. Constructions having successive deletions, each 
distant by ca. 200 pb, are selected. The localization of the 5' end of the deletion is determined by sequencing 
so the region and comparison with the nucleotide sequence of the hsr203J gene (see Figure 6). 

Effect of deletions on gene expression of the chimeric gene in transgenic tobacco plants 

50 u.g plasmid DNA corresponding to the different deletions (Fig. 6) are introduced by transformation into 
55 tobacco plants. Th GUS activity is m asured 18 hours after inoculation. 

FIG. 7 shows the expression of the GUS gen by constructs obtained by 5* promot r del tions of pHG21 
(according to the scheme of Figure 5). The plants were transformed with 5 \xg DNA, and the valu 100 was 
given to th GUS activity obtain d by transformation with the pHG21 construct. The Figure shows the increase 
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in activity (after 18 hours) of the GUS gene as a consequence of infiltration of the transformed plants with the 
bacterial strains Delta 3, K60 and GM1 1000. As controls plants were infiltrated with water. These experiments 
indicate the presence of 2 main regions having a regulatory effect of the deletion promoter of the hsr203J gene. 

One or more elements situated in the 1-651 nucleotide region of SEQ ID No. 1 are responsible for a di- 
5 minution of the expression of the chimeric gene, and elements situated in the second region (nucleotides 652- 
1268) exhibit a positive effect on the activation of the promoter of the hsr203J gene. 

The study of the spatial and temporal patterns of promoter activation in roots and leaves of transgenic 
plants inoculated with Pseudomonas solanacearum, indicate that 

- the promoter is specifically activated during the HR several hours before the appearance of the necrotic 
10 lesion 

- the localization of its activation is restricted to the few cell layers in contact with the bacteria 

- the promoter does not respond to various stress conditions and is very weakly activated during com- 
patible interactions 

- the promoter activation is strongly dependent on hrp (hypersensitive response and pathogenicity) genes 
15 of Pseudomonas solanacearum. These genes control the ability of the bacterium to elicit the HR in re- 
sistant or non-host plant and to cause the disease on the host plant. 

In favour of a major role of the bacterial hrp genes in the activation of hsr 203 J gene promoter, is the fact 
that the hsr 203 promoter is expressed in response to an HR specific elicitor, harpin, product of one of the hrp 
genes of Erwinia amylovora. In response to this polypeptide, the promoter is activated at similar levels to those 

20 observed with the corresponding avirulent strain, but more rapidly. Other potential inducers such as biotic and 
abiotic elicitors, resistance inducers, do not affect its expression. The generality of the specific expression of 
hsr 203J during incompatible interactions with bacterial pathogens has been demonstrated by testing other 
pathogens such as Pseudomonas syringaepv pisi / pseudomonas syringae pvtabacl, and Erwinia amylovora. 
In addition the functional analysis of the cis elements responsible for the transcriptional activation of the 

25 hsr 203J gene in response to the incompatible bacterial strain, has been initiated by generating a series of 
5' deletions and analysis of these constructs by transient assay and in transgenic plants. The results reveal 
the presence of a distal silencer element, and of two positive regulatory elements, one being quantitative (nu- 
cleotides 655-770 in SEQ ID No. 1), the other one being specific for the response to the bacterium, between 
nucleotides 1195 and 1268 of the SEQ depicted in SEQ ID No. 1. 

30 These results indicate that the hsr 203J gene promoter exhibits new and original characteristics of acti- 

vation with regard to plant defense genes studied so far; its spatial and temporal program of activation together 
with its specific induction during the HR underline the importance of this gene as a molecular tool to study 
the establishment and regulation of the HR. In addition, a 74 bp sequence element has been defined as re- 
sponsible for the inducibility of the promoter by the avirulent pathogen. 

35 Although the invention has been specifically described with reference to activation of the hsr203J promot r 

in response to challenge of Tobacco plants with an incompatible pathogen, it will be appreciated that the pro- 
moter may likewise be activated by challenge of other plants transgenic for the gene with other pathogens, 
including certain viruses and certain fungi, indicating that specific expression of the hsr203J promoter is a gen- 
eral phenomenon of incompatible interactions between host and pathogen which lead to the hypersensitive 

40 response. 

Moreover, the nucleotide sequence comprised by positions 1195 to 1268 of the sequence depicted in SEQ 
ID No. 1 containing the bacterial response element binds to nuclear protein extracts from various sources (heal- 
thy plants, plants inoculated with Pseudomonas solanacearum strains: compatible , incompatible and the hrp- 
mutant, after different incubation times). Such binding may be estimated by retardation gel analysis using, 
45 for example, the 74 bp region and several sub-fragments thus enabling identification of discrete sequences 
within the BRE region which are useful in providing genetic constructs comprising inducible disease resistanc 
genes. 
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SEQUENCE LISTING 



(1) GENERAL INFORMATION: 

(i) APPLICANT: 

(A) NAME: Sandoz Ltd 

(B) STREET : Lichtstrasse 35 

(C) CITY: Basel 

(D) STATS: BS 

(E) COUNTRY: Switzerland 

(F) POSTAL CODE (ZIP) : CH-4002 

(G) TELEPHONE: 061-324-2327 

(H) TELEFAX: 061-322-7532 

(I) TELEX: 965-050-55 

(A) NAME: Sandoz Patent GMBH 

(B) STREET: Humboltstrasse 3 

(C) CITY: Loerrach 

(E) COUNTRY: Germany 

(F) POSTAL CODE (ZIP): D-7850 

(A) NAME: Sandoz Erfindungen Verwaltur.gsgesellschaf t 

mbH 

(B) STREET: Brunner Strasse 59 

(C) CITY: Vienna 

<E) COUNTRY: Austria 

(F) POSTAL CODE (ZIP): A-1235. 

(ii) TITLE OF INVENTION: Hypersensitivity related gene 

(iii) NUMBER OF SEQUENCES: 3 

(iv) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE: Patentln Release #1.0, Version #1.25 (EPO) 



(2) INFORMATION FOR SEQ ID NO: 1: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2778 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: genomic DNA 

(iii) HYPOTHETICAL: NO 

(iii) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Tobacco 

(ix) FEATURE: 

(A) NAME /KEY: CDS 

(3) LOCATION: 1413.. 2417 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1: 
GGATCTTAAT GTTAGTTTAT CTCTTGTTTT GAATATTTGA TCTTAATTAT AATTTATCCA 
CCATAAATTT TATTTTCAAA GATCAAACTA TTGATATGAC ATTTCACTTT TTTATCTTTA 
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5 



10 



15 



20 



25 



TGTTTGTAGA 


ATCATTAGTG 


GTATTGACTC 


TTACCAATCA 


TTTTTTTTTC 


TTTCTCACAC 


180 


ATTTATATTC 


TTAAATTTTC 


TTAGTTATTG 


TTTAATAATT 


GGGTATTTTT 


TAATATTACA 


240 


CGAAAAATTG 


ATTAAAAAAA 


TATTATTTGA 


GTAGAAAAAT 


AGTTCAAATA 


TAATATAAAC 


300 


ATATATTATC 


GTGGGAGTAT 


TTTTTTCTCA 


ATTTCAACTC 


TTTATGCAGT 


CCACTTAATA 


360 


TTACTTTTAT 


TTTTTCTTGG 


TATTAGACAT 


TATGGAGTGG 


TAATGTATTG 


CCAATACGGC 


420 


TGATTCTTAT 


GAAATTGATT 


TTATTAAACC 


TTCCTACATT 


TTTAATAATA 


ATTTAATAGA 


480 


CAAAATTTTA 


TTAATTTTAA 


ATATTAAATA 


TTAAAAATTA 


GTAGCATATA 


AGGTATTATA 


540 


GTCCAAAAAA 


TAGCTTATTA 


CAGTTACGTA 


CTCTTCCTAT 


GAGTTCTTTC 


GTTTAATAAT 


600 


GTAGGGCTAT 


TTTGATATAT 


TAATA7TGTA 


TTTATGCTTT 


TATAATAATA 


TAGGCTCTCT . 


660 


TTTTTC7ATA 


TGAATTTGGA 


CAATATAATA 


CAT7TTCAAA 


TTAAATTAGT 


ATCAAATAAT 


720 


TGTATTTTTG 


CTTTTTTAAT 


AATTTATACG 


CATGAATTTC 


ATAATCCAGC 


ATATTATGCT 


780 


AGAACTTTTC 


GTGTTTCAAC 


TAAAATAATG 


ACTATTTTTC 


AATGACGTTA 


CAAACACTGA 


840 


CTAATTTTTG 


ATTGCAGTCC 


GAAAACTATC 


TAGTCTATGC 


TATTTTCACT 


TTTCTAAACT 


900 


CCCTGCCACT 


GTATGCTTTC 


ATTGGATTAA 


CCTTTAACCA 


CACAAATATT 


TTAAAGAGTA 


960 


ATGTTTGACA 


GCGTAATTXG 


AAACATCTAC 


TATGCCTCTG 


TATATAATAT 


CTAATGTTTG 


1020 


TTCGTAGACC 


AATATTCTAA 


TTCCTCTCTT 


G T AG AC T AAA 


CGGGGCTGTA 


ACTAACTAAC 


108C 


CACCATAGTT 


ATCTAAATTA 


GTGACCCTAG 


CGACCATTGA 


TAATTTGATA 


CTGATCATTG 


114C 


ACTTCCACCA 


AATCTACTTT 


CTAAATGTGG 


ACTGACTCAT 


TATGAATTTG 


TGAGGAAAAT 


1200 


ACTTTCCTAA 


TGCTAGTGCT 


CTTCCCATTA 


TCTAAACTCC 


AAAATTTTGT 


AAAATTC7TT 


1260 


GAACCTTCCT 


TTAAACTACC 


ACAAATTTTC 


TTATCCTTTC 


CTATCTCACC 


ATTATAAATA 


1320 


GCCACGCACA 


TGCAAACCAA 


AGGTACACAC 


TAAACAAACT 


TCATTCTTCA 


AATTACTGAT 


1380 



GTT CAT GAA AAG CAA GTG 1433 
Val His Glu Lys Gin Val 
5 

TTC GAA GAC GGT TCA GTA 1481 
Phe Glu Asp Gly Ser Val 
20 

AAA TTC ATG GCC GAG CCA 1529 
Lys Phe Met Ala Glu Pro 
35 

GTT GCC GTC AAA GAT GTA 1577 
Val Ala Val Lys Asp Val 
50 55 

CGC ATC TAC TTA CCT GAA 1625 
Arg lie Tyr Leu Pro Glu 
70 

GTC ATT CTT CAC TTC CAA 1673 
Val He Leu His Phe Gin 
85 

TGG TTC ATG TAC TAC ACT 1721 
Trp Phe Met Tyr Tyr Thr 
100 



35 

TACTCGAAAA AAACACTTCA AACTTTGCCA AA ATG 

Met 
1 

ATA GAG GAA GTA TCC GGC TGG CTT AGA GTT 
He Glu Glu Val Ser Gly Trp Leu Arg Val 
40 10 15 

GAC CGG ACT TGG ACC GGT CCA CCC GAA GTC 
Asp Arg Thr Trp Thr Gly Pro Pro Glu Val 
25 30 



45 



50 



55 



GTC CCA CCC CAT GAC TAC TTC ATC GAC GGC 

Val Pro Pro His Asp Tyr Phe He Asp Gly 
40 45 

GTC GCC GAC GAA AAA TCC GGC AGC CGT CTC 

Val Ala Asp Glu Lys Ser Gly Ser Arg Leu 

60 65 

CGA AAC GAC AAT TCC GCC AGC AAG CTT CCC 

Arg Asn Asp Asn Ser Ala Ser Lys Leu Pro 
75 80 

GGC GGC GGC TTT TGT GTC AGC CAT GCT GAT 

Gly Gly Gly Phe Cys val Ser His Ala Asp 
90 95 
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GTC TAC ACG CGC CTA GCG CGC GCG GCC AAA GCT ATC ATT GTC TCC GTC 17 69 

Val Tyr Thr Arg Leu Ala Arg Ala Ala Lys Ala lie lie Val Ser Val 
105 110 115 

5 TTC CTC CCC CTC GCG CCG GAG CAC CGC CTC CCA GCT GCC TGC GAT GCC 1817 

?he I^eu Pro Leu Ala Pro Glu His Arg Leu Pro Ala Ala Cys Asp Ala 
120 125 130 135 

GGT TTC GCC GCT CTC CTC TGG CTC CGG GAC CTC TCC CGG CAG CAA GGA 18 65 

Gly Phe Ala Ala Leu Leu Trp Leu Arg Asp Leu Ser Arg Gin Gin Gly 
™ 140 145 150 

CAC GAG CCC TGG CTC AAC GAT TAC GCA GAT TTC AAC CGA GTA TTC CTC 1913 
His Glu Pro Trp Leu Asn Aso Tyr Ala Asp Phe Asn Arg Val Phe Leu • 
155 * 160 165 

)5 ATC GGA GAC AGC TCC GGC GGG AAC ATA GTC CAC CAA GTT GCC GTC AAA .19 61 

lie Gly Asp Ser Ser Gly Gly Asn He Val His Gin Val Ala Val Lys 
170 175 180 

GCC GGC GAG GAA AAC TTA TCT CCA ATG CGA CTG GCC GGC GCA ATT CCG 200 9 

Ala Gly Glu Glu Asn Leu Ser Pro Met Arg Leu Ala Gly Ala lie Pro 
2Q . 185 190 195 

ATC CAT CCA GGT TTC GTG CGG TCC '.TAT CGG AGC AAA TCG GAG CTA GAA 2057 
lie His Pro Gly Phe Val Arg Ser Tyr Arg Ser Lys Ser Glu Leu Glu 
200 205 210 215 

CAA GAG CAA ACC CCG TTT TTA ACA TTA GAT ATG GTG GAT AAA TTT CTA 2105 
25 Gin Glu Gin Thr Pro Phe Leu Thr Leu Asp Met Val Asp Lys Phe Leu 

220 225 230 

GGG TTA GCT TTA CCA GTA GGG AGC AAC AAG GAT CAT CAA ATA ACA TGT 2153 

Gly Leu Ala Leu Pro Val Gly Ser Asn Lys Asp His Gin He Thr Cys 

235 240 245 

30 

CCG ATG GGA GAG GCG GCG CCG GCA GTG GAG GAG CTT AAA TTA CCG CCT 2201 

Pro Met Gly Glu Ala Ala Pro Ala Val Glu Glu Leu Lys Leu Pro Pro 

25C 255 260 

TAT TTG TAC TGT GTG GCG GAG AAA GAT CTG ATA AAG GAC ACT GAA ATG 2249 
Tyr Leu Tyr Cys Val Ala Glu Lys Aso Leu He Lys Asp Thr Glu Met 
265 270 275 

GAG TTT TAC GAA GCT ATG AAA AAG GGG GAA AAG GAT GTA GAG CTG TTT 2297 
Glu Phe Tyr Glu Ala Met Lys Lys Gly Glu Lys Asp Val Glu Leu Phe 
280 285 290 295 

40 ATT AAC AAT GGA GTG GGA CAT AGC TTT TAT CTT AAC AAA ATT GCT GTT 2345 
He Asn Asn Gly Val Gly His Ser Phe Tyr Leu Asn Lys He Ala Val 
300 305 310 

AGA ATG GAC CCT GTA ACT GGT TCT GAA ACT GAA AAA CTT TAT GAA GCC 2393 
Arg Met Asp Pro Val Thr Gly Ser Glu Thr Glu Lys Leu Tyr Glu Ala 
315 320 325 



35 



45 



50 



GTT GCA GAG TTC ATC AAC AAG CAT TA AAAGGAGAAA ATTTGTGGTT 2439 
Val Ala Glu Phe lie Asn Lys His 
330 335 



55 



TTGCAGAATA 


TTTGTTTGTT 


GCATGCATGT 


TC AAG AT TTT 


GATGTACCGT 


CTTGATTGTC 


2499 


ACGTTCTAAT 


GGTTTTGTAA 


TTATAATTAT 


GAGGAGTAAA 


TTTCTATTGT 


TGCGTAGAAA 


2559 


TGTTTTTTCT 


TTGGTAGTAA 


ATGTTTATTT 


GTAATACTTT 


AAAAAGTGGA 


CAAATTTCTT 


2619 


TTGAGATTCA 


TGAAATAATA 


TCTTTAAATT 


TCGAATGTCA 


ATAAGTCCAG 


AAATTGAAAT 


2679 


GTATCTGTAC 


CGTCAATGAA 


GTCTCCTTGA 


GGCTTTTTTT 


CACATGATAT 


CGTCTATACC 


273 9 
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ACCAAAAAGT TTGATAAGCT ATACAATATG AGATTCTCG 2778 

(2) INFORMATION FOR SEQ ID NO : 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 335 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

10 (ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2: 

Met Val His Glu Lys Gin Val lie Glu Glu Val Ser Gly Trp Leu Arg 
15 10 15 

15 

Val Phe Glu Asp Gly Ser Val Asp Arg Thr Trp Thr Gly Pro Pro Glu 
20 25 30 

Val Lys Phe Met Ala Glu Pro Val Pro Pro His Asp Tyr Phe lie Asp 
35 40 45 

Gly Val Ala Val Lys Asp Val Val Ala Asp Glu Lys Ser Gly Ser Arg 
50 55 60 

Leu Arg lie Tyr Leu Pro Glu Arg Asn Asp Asn Ser Ala Ser Lys Leu 
65 70 75 80 

25 Pro Val He Leu His Phe Gin Gly Gly Gly Phe Cys Val Ser His Ala 

85 90 -95 

Asp Trp Phe Met Tyr Tvr Thr Val Tyr Thr Arg Leu Ala Arg Ala Ala 
100 " 105 110 



20 



30 



Lys Ala He He Val Ser Val Phe Leu Pro Leu Ala Pro Glu His Arg 
115 120 125 

Leu Pro Ala Ala Cys Asp Ala Gly Phe Ala Ala Leu Leu Trp Leu Arg 
130 ; 135 140 

Asp Leu Ser Arg Gin Gin Gly His Glu Pro Trp Leu Asn Asp Tyr Ala 
35 145 150 155 160 

Asp Phe Asn Arg Val Phe Leu He Gly Asp Ser Ser Gly Gly Asn He 
165 170 175 

Val His Gin Val Ala Val Lys Ala Gly Glu Glu Asn Leu Ser Pro Met 
40 1 80 1 85 1 90 

Arg Leu Ala Gly Ala He Pro He His Pro Gly Phe Val Arg Ser Tyr 
195 200 205 



45 



50 



55 



Arg Ser Lys Ser Glu Leu Glu Gin Glu Gin Thr Pro Phe Leu Thr Leu 
210 215 220 

Asp Met Val Asp Lys Phe Leu Gly Leu Ala Leu Pro Val Gly Ser Asn 
225 230 235 240 

Lys Asp His Gin He Thr Cys Pro Met Gly Glu Ala Ala Pro Ala Val 
245 250 255 

Glu Glu Leu Lys Leu Pro Pro Tyr Leu Tyr Cys Val Ala Glu Lys Asp 
260 265 270 

Leu He Lys Asp Thr Glu Met Glu Phe Tyr Glu Ala Met Lys Lys Gly 
275 280 285 

Glu Lys Asp Val Glu Leu Phe He Asn Asn Gly Val Gly His Ser Phe 
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290 295 300 

Tyr Leu Asn Lys lie Ala Val Arg Met Asp Pro Val Thr Gly Ser Glu 
3C5 310 315 320 

5 

Thr Glu Lys Leu Tyr Glu Ala Val Ala Glu Phe lie Asn Lys His 
325 330 335 

(2) INFORMATION FOR SEQ ID NO : 3: 

10 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 93 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: unknown 
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25 



(ii) MOLECULE TYPE: genomic DNA 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 
TTTGCCAAAA TGGTTCATGA AAAGCAAGTG ATAGAGGAAG TATCCGGCTG GCTTAGAGTT 60 
TTCGGGGTAG GTCAGTCCCT TATGTTACGT CCT ■ 93 



Claims 

30 1. A recombinant DNA sequence including a region comprising the nucleotide sequence depicted in SEQ 
ID No. 1 or a functional equivalent thereof, or a recombinant sequence comprising a part of said region 
or said equivalent. 

2. A recombinant DNA sequence including a region comprising nucleotides 1413 to 2417 of the sequence 
35 depicted in SEQ ID No. 1 or a functional equivalent thereof, or a recombinant sequence comprising a part 

of said region or said equivalent 

3. A recombinant DNA sequence including a region comprising nucleotides 1 to 1341 of the sequence de- 
picted in SEQ ID No. 1 or a functional equivalent thereof, or a recombinant sequence comprising a part 

^ of said region or said equivalent 

4. Arecombinant DNAsequence including a region comprising nucleotides 1 to 651 of the sequence depicted 
in SEQ ID No. 1 or a functional equivalent thereof, or a recombinant sequence comprising a part of said 
region or said equivalent 

45 5. Arecombinant DNAsequence including a region comprising nucleotides 652 to 1341 of the sequence de- 
picted in SEQ ID No. 1 or a functional equivalent thereof, or a recombinant sequence comprising a part 
of said region or said equivalent 

6. A recombinant DNA sequence including a region comprising nucleotides 1195 to 1341 of the sequence 
50 depicted in SEQ ID No. 1 or a functional equivalent thereof, or recombinant sequence comprising a part 

of said region or said equivalent 

7. A recombinant DNA sequence including a r gion comprising nucleotides 1195 to 1268 of the sequence 
depicted in SEQ ID No. 1 or a functional quivalent thereof, or a recombinant sequence comprising a part 

55 of said region or said equivalent 

8. A recombinant DNA sequence comprising at least one region or part or equivalent thereof, according to 
any one of claims 3 to 7, wherein said region or part or equivalent is located on the 5' side of, and is op- 
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erably linked to, a protein-encoding sequence of a heterologous gene or to a sequence comprising nu- 
cleotides 1413 to 2417 of the sequence depicted in SEQ ID No. 1 or a functional equivalent thereof. 

9. A recombinant DNA sequence according to the preceding claim, wherein a translation enhancing se- 
quence is present between the region or part or equivalent thereof, and the protein-encoding region of 
the DNA sequence 3* thereto. 

10. A recombinant DNA sequence according to either of claims 8 or 9, wherein the heterologous gene is a 
selectable or screenable marker gene or a gene, the product of which is capable of conferring resistance 
or tolerance to at least one of the following: insects, herbicides, fungi, bacteria and viruses. 

11. A recombinant DNA sequence according to any preceding claim, wherein one or more nucleotides have 
been added to, removed from or substituted in the recombinant sequence without substantially affecting 
the function or amino acid encoding capability thereof. 

12. Recombinant DNA according to any one of claims 1 to 11 1 which is modified in that codons which are pre- 
ferred by the organism into which the recombinant DNA is to be inserted are used so that expression of 
the thus modified DNA in the said organism yields substantially similar protein to that obtained by expres- 
sion of the unmodified recombinant DNA in the organism in which the protein-encoding components of 
the recombinant DNA are endogenous. 

13. A DNA sequence which is complementary to one which, under stringent conditions, hybridizes to the re- 
combinant DNA sequence according to any one of claims 1 to 12. 

14. A DNA vector comprising a recombinant DNA sequence according to any one of claims 1 to 12, or a DNA 
sequence according to claim 13. 

1 5. Protein obtained by expression of the DNA according to any one of claims 1 to 1 3. 

16. Protein having the amino acid sequence depicted in SEQ ID No. 2, or a functional equivalent of said se- 
quence. 

17. A micro-organism or plant cell or protoplast which has been transformed with recombinant DNA according 
to any one of claims 1 to 11, or a DNA sequence according to claim 12. 

18. A plant, the genome of which comprises a vector according to claim 14, in which plant the recombinant 
DNA is expressed. 

19. A plant according to the preceding claim selected from the group consisting of: tomatoes, peppers, man- 
goes, peaches, apples, pears, strawberries, bananas, melons, canola, sunflower, tobacco, sugar beet, 
wheat, barley, rice, corn, cotton, potato, carrot, lettuce, cabbage and onion. 

20. The progeny or seeds of plants according to either of claims 1 8 or 1 9, and the seeds and progeny of said 
progeny. 
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